Randomization in Parallel Stringology
نویسنده
چکیده
In this abstract, we provide an overview of our survey of randomized techniques for exploiting the parallelism in string matching problems. Broadly, the study of string matching falls into two categories: standard stringology and nonstandard stringology. Standard Stringology concerns the study of various exact matching problems. The fundamental problem here is the basic string matching problem where given a pattern string p 2 m and a text string t 2 n, the problem is to nd all occurrences i in t where the pattern occurs, that is, t[i i+m 1] = p[1 m]. Other well-studied problems within standard stringology include multiple pattern matching, dictionary matching, text indexing, etc. Non-standard Stringology concerns the study of various inexact matching problems. A fundamental problem here is string matching with wildcards where the problem, as before, is to nd all occurrences of the pattern within the text; however, some positions in the text and the pattern are marked as \wildcards"(denoted ), that is, these positions match whatever be the symbol in the string they align against. Formally, given p occurs in t at i if for all 1 j m, t[i+j 1] = p[j] if p[j] 6= and t[i+j 1] 6= . Another important problem here is the k-mismatches problem in which the goal is to determine all those positions in the text where the pattern occurs with at most k mismatches amongst the aligned symbols. Other problems here include subset matching, range matching etc. Stringology includes study of many other problems involving
منابع مشابه
Arithmetic Coding in Parallel
We present a cost optimal parallel algorithm for the computation of arithmetic coding. We solve the problem in O(log n) time using n/log n processors on EREW PRAM. This leads to O(n) total cost.
متن کاملRe-randomization increased recruitment and provided similar treatment estimates as parallel designs in trials of febrile neutropenia.
BACKGROUND Re-randomization trials allow patients to be re-enrolled for multiple treatment episodes. However, it remains uncertain to what extent re-randomization improves recruitment compared to parallel group designs, or whether treatment estimates might be affected. METHODS We evaluated trials included in a recent Cochrane review of granulocyte colony-stimulating factors for patients with ...
متن کاملUsing alignment for multilingual text compression
Multilingual text compression exploits the existence of the same text in several languages to compress the second and subsequent copies by reference to the first. We explore the details of this framework and present experimental results for parallel English and French texts.
متن کاملCompressing Bi-Level Images by Block Matching on a Tree Architecture
A work-optimal O(log M log n) time parallel implementation of lossless image compression by block matching of bi-level images is shown on a full binary tree architecture under some realistic assumptions, where n is the size of the image and M is the maximum size of the match. Decompression on this architecture is also possible with the same parallel computational complexity. Such implementation...
متن کاملAn Efficient Parallel Determinisation Algorithm for Finite-state Automata
Determinisation of non-deterministic finite automata (NFA) is an important operation not only for optimisation purposes, but also the prerequisite for the complementation operation, which in turn is necessary for creating robust pattern matchers, for example in string replacement and robust parsing. In the paper, we present an efficient parallel determinisation algorithm based on a message-pass...
متن کامل